Search CORE

45 research outputs found

A methodology for determining amino-acid substitution matrices from set covers

Author: A. Bahr
A.D. McLachlan
D.F. Feng
G. Vogt
G.H. Gonnet
J. Setubal
J.D. Blake
J.K.M. Rao
M. Gribskov
M.F. Sagot
R.B. Russell
R.E. Green
R.F. Smith
S. Henikoff
S.A. Benner
T. Müller
T.P. Li
W.S.J. Valdar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/04/2005
Field of study

We introduce a new methodology for the determination of amino-acid substitution matrices for use in the alignment of proteins. The new methodology is based on a pre-existing set cover on the set of residues and on the undirected graph that describes residue exchangeability given the set cover. For fixed functional forms indicating how to obtain edge weights from the set cover and, after that, substitution-matrix elements from weighted distances on the graph, the resulting substitution matrix can be checked for performance against some known set of reference alignments and for given gap costs. Finding the appropriate functional forms and gap costs can then be formulated as an optimization problem that seeks to maximize the performance of the substitution matrix on the reference alignment set. We give computational results on the BAliBASE suite using a genetic algorithm for optimization. Our results indicate that it is possible to obtain substitution matrices whose performance is either comparable to or surpasses that of several others, depending on the particular scenario under consideration

arXiv.org e-Print Archive

Crossref

On merging the fields of neural networks and adaptive data structures to yield new pattern recognition methodologies

Author: B. Allen
B.J. Oommen
B.J. Oommen
B.J. Oommen
C.A. Astudillo
C.A. Astudillo
D.D. Sleator
D.E. Knuth
G.H. Gonnet
G.M. Adel’son-Velski’i
K. Mehlhorn
R. Duda
R.P. Cheetham
S. Haykin
T. Kohonen
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2011
Field of study

The aim of this talk is to explain a pioneering exploratory research endeavour that attempts to merge two completely different fields in Computer Science so as to yield very fascinating results. These are the well-established fields of Neural Networks (NNs) and Adaptive Data Structures (ADS) respectively. The field of NNs deals with the training and learning capabilities of a large number of neurons, each possessing minimal computational properties. On the other hand, the field of ADS concerns designing, implementing and analyzing data structures which adaptively change with time so as to optimize some access criteria. In this talk, we shall demonstrate how these fields can be merged, so that the neural elements are themselves linked together using a data structure. This structure can be a singly-linked or doubly-linked list, or even a Binary Search Tree (BST). While the results themselves are quite generic, in particular, we shall, as a prima facie case, present the results in which a Self-Organizing Map (SOM) with an underlying BST structure can be adaptively re-structured using conditional rotations. These rotations on the nodes of the tree are local and are performed in constant time, guaranteeing a decrease in the Weighted Path Length of the entire tree. As a result, the algorithm, referred to as the Tree-based Topology-Oriented SOM with Conditional Rotations (TTO-CONROT), converges in such a manner that the neurons are ultimately placed in the input space so as to represent its stochastic distribution. Besides, the neighborhood properties of the neurons suit the best BST that represents the data

Crossref

NORA - Norwegian Open Research Archives

Agder University Research Archive

A grammar-based approach towards unifying hierarchical data models

Author: D. van Gucht
Gonnet G.H.
J. Paredaens
Janssens D.
Kersten M.L.
M. Gyssens
Paxedaens I.
Thomas S.J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1988
Field of study

Crossref

Pure OAI Repository

A grammar-based approach towards unifying hierarchical data models

Author: D. van Gucht
Gonnet G.H.
J. Paredaens
Janssens D.
Kersten M.L.
M. Gyssens
Paxedaens I.
Thomas S.J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Combining Pat-Trees and Signature Files for Query Evaluation in Document Databases

Author: A. Dessmark
A.V. Aho
C. Faloutsos
C. Faloutsos
C. M
D. Harman
D.E. Knuth
D.R. Morrison
F. Bancihon
G. Salton
G.H. Gonnet
I.A. Macleod
K. Böhm
S. Abiteboul
S. Christodoulakis
Y. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Compressed Suffix Arrays for Massive Data

Author: A. Crauser
F. Kulla
G. Navarro
G.H. Gonnet
H.-L. Chan
J. Kärkkäinen
J. Sirén
J.C. Na
M.I. Abouelhoda
N.J. Larsson
P. Elias
P. Ferragina
R. González
R. Grossi
S. Lee
S.J. Puglisi
V. Mäkinen
V. Mäkinen
W.-K. Hon
W.-K. Hon
W.-K. Hon
Publication venue
Publication date: 01/01/2009
Field of study

We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA). The algorithm requires O(n log n) time in the worst case, and only O(n) bits of extra space in addition to the CSA. As the basic step, we describe an algorithm for merging two CSAs. We show that the construction algorithm can be parallelized in a symmetric multiprocessor system, and discuss the possibility of a distributed implementation. We also describe a parallel implementation of the algorithm, capable of indexing several gigabytes per hour

Crossref

Helsingin yliopiston digitaalinen arkisto

Recommended from our members

Parallel computing in information retrieval - An updated review

The progress of parallel computing in Information Retrieval (IR) is reviewed. In particular we stress the importance of the motivation in using parallel computing for Text Retrieval. We analyse parallel IR systems using a classification due to Rasmussen [1] and describe some parallel IR systems. We give a description of the retrieval models used in parallel Information Processing.. We describe areas of research which we believe are needed

City Research Online

Crossref

Cache-Oblivious Hashing

Author: A. Aggarwal
A. Pagh
B. He
D.E. Knuth
E. Demaine
E. Verbin
G. Tenenbaum
G.H. Gonnet
G.S. Brodal
H.G. Mairson
J. Carter
J. Schmidt
Ke Yi
M. Frigo
M. Wegman
M.A. Bender
M.A. Bender
M.L. Fredman
M.S. Jensen
P. Afshani
P.-A. Larson
P.-A. Larson
Qin Zhang
R. Fagin
R. Motwani
R. Pagh
Rasmus Pagh
W. Litwin
Z. Wei
Zhewei Wei
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The analysis of a hashing scheme by the diagonal poisson transform

Author: D.E. Knuth
G.H. Gonnet
G.H. Gonnet
M. Abramowitz
O. Amble
P.V. Poblete
P.V. Poblete
R.P. Brent
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

On the Lambert W Function

Author: D. J. Jeffrey
D.E. Knuth
D.E.G. Hare
G.H. Gonnet
R.M. Corless
Publication venue
Publication date: 01/01/1996
Field of study

The Lambert W function is defined to be the multivalued inverse of the function w 7! we w . It has many applications in pure and applied mathematics, some of which are briefly described here. We present a new discussion of the complex branches of W , an asymptotic expansion valid for all branches, an efficient numerical procedure for evaluating the function to arbitrary precision, and a method for the symbolic integration of expressions containing W . On the Lambert W function 2 1. Introduction In 1758, Lambert [47] solved the trinomial equation x = q + x m by giving a series development for x in powers of q. Later [48], he extended the series to give powers of x as well. Euler [28] transformed Lambert's equation into a more symmetric form by substituting x \Gammafi for x and setting m = fffi and q = (ff \Gamma fi)v. His version of the equation was x ff \Gamma x fi = (ff \Gamma fi)vx ff+fi ; (1:1) and his version of Lambert's series solution was x n = 1 + nv + 1 2 ..

CiteSeerX